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I. 



INTR ODUCTION 



IQ these days of increasing costs, the Navy nust ensure 
it is recruiting and retaining the best possible personnel. 
Enlistees should not only have the capacity to do the work, 
but also the desire and drive to accomplish the assigned 
tasks. Currently, there are few measures of effectiveness 
which indicate how an enlistee will perform in the Navy. 
The SCREEN score, developed by Dr. Robert Lockman of the 
Center for Naval Analyses, predicts attrition of first term 
recruits sas a function of test score and biographic data 
Ml. It does not, however, specifically address the 
performance of a member while in the Navy; rather it 
predicts survival in the Navy. 

The advancement to S-4 is a possible measure of 
effectiveness with which a member's utility to the Navy 
could be judged. First, the member must display a certain 
amount of initiative and drive to prepare for the exam. 
Second, passing the test is an indication that a member has 
reached a basic level of knowledge within a specific rating. 
It is at this point that the Navy begins to get a return on 
its investment in the member. The S-4 test, therefore, may 
be used as a measure of both a member's capacity and desire 
to perform in the Navy. 

This thesis explores the differences between the E-4 
test passers and non-test passers based on the individual's 
biographic and demographic data. One would like to know 
what variables in a member's background are associated with 
his ability and desire to pass the exam. Specifically, an 
ordered list of determining factors is desired. Also, it is 
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important to learn if tie determining factors are identical 
for all ratings or if they vary from rating to rating. 
Finally, it would be interesting to discover how length of 
time in service and time in rate are related to a member's 
tendency to advance. 

For the purposes of this analysis, a test passer is 
defined as a member who has taken the S-4 exam and received 
a score which caused him to be advanced to E-4. k non-test 
passer is a member who has either taken the E-4 exam and did 
not receive a high enough score to be advanced or who did 
not take the exam even though he was eligible. dost of the 
non-test passers in the study are non-test takers. The 
separation of the non-test takers from the non-test passers 
was not possible due to the lack of data. The reader will 
need to keep these definitions in mind while reading the 
text. For clarity, a graphical presentation of these 
definitions may be found in Figure 1 in Chapter III. 

The data used in this study come from all personnel 
eligible to take the E-4 exam in August 1977. A member was 
considered eligible if ha had at least one year in the Navy 
and at least six months is an E-3. 

Two major results were obtained from this study. First, 
the use of non-linear discrimination resulted in correct 
classification of 60 - 70S of the members of each sample 
examined. Second, hierarchical clustering was applied to 
group the ratings (especially those with small numbers of 
members) . These groupings may have general use beyond those 
of this paper. 
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II. DI SESSION OP DATA 



There are many possible factors which aay affect, a 
fflember' s desire and ability to pass the 2-4 exaa. Examples 
of these are years nf education, job satisifaction , 
leadership of superiors, sea/shore assignment, etc. Data 
for some of the more appealing of these was unavailable for 
the study. The factors used in this analysis were all drawn 
from the August 1977 monthly Enlisted Master Hecord (HMa) 
fils located at the Naval Military Personnel Center in 
Arlington, Virginia. The month of August was chosen because 
it coincided with a month in which the E-4 exams were given. 

The SMH is a 3000 character record which is maintained 
on each active duty and reserve enlisted member of the Navy. 
Such items as social security number, pay entry base date, 
schools attended, and test scores are stored on each record. 
Each record is a condensed version of a member's personnel 
jacket. 

To make the data easier to manipulate, the 3000 
character record file was reduced to a 150 character record 
file. Each record contained the following demographic data: 



Age 

Sex 

Race 

Ethnic group 
Home of record 
Dependants 
Time in rate 

Active duty/Reserve duty indicator 
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Length of service 

End of active dnty obligation date 

Enlistment term 

lears of education 

Education certification 

A-School/No A-School indicator 

Special test scores 

SCREEN score 

Mental aptitude test scores (ASVAB or BIB) 



The special test scores measure aptitude in the 
sonar, electronics, and radio. Iha SCREEN sco 
measure of a recruit's chances of completing the 
years of his enlistment. The mental aptitude test 
measure of the member's overall intelligence. 
Test Battery (BIB) consists of five exams in the 
general intelligence, numerical reasoning, and me 
clerical, and shop aptitude. The Armed Services V 
Aptitude Battery (ASVAB) contains sixteen tests 
specific areas of intelligence and aptitude, inclu 
ones in the 3TB. Unfortunately, the ASVAB had 
given to a large enough sample of the population to 
to be used in the analysis. 



areas of 
re is a 
first two 
3 are a 
The Basic 
areas of 
chanical, 
oca tional 
measuring 
ding the 
not been 
allow it 



From the data set, the individuals wars extracted whose 
length of service was at least one year, and whose time in 
rate was at least six months. These were considered the 
members eligible to take the E-4 exam. Since completion of 
correspondence courses and practical factors is not 
reflected in the EHH, it was not possible to check these 
aligibilxty requirements. However, it was assumed that if a 
member fulfilled the time requirements and desired to taxe 
the exam, he would have his other requirements completed. 



The variables of the data were originally in three 
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scales; biua 
data in «h 
3iaclc=1, Chi 
form of the 
Interval da 
differences 
psychologica 
of the nomin 
to meet the 
educational 
responses ; 
degree, and 
no high scho 
were grouped 



ry, nominal, and interval. Nominal data are 
ich numbers are used as labels or "names”, e.g., 
cano=2, White=3. Binary data were a special 
nominal where only two responses are available, 
ta include all continuous variables where 
between scores are meaningful. For example, 
1 test data are treated as interval data. All 
al data were converted to binary values in order 
assumptions of the analysis. For instance, the 
certification was transformed from the 
no degree, GED, high school diploma. Bachelor's 
postgraduate degree, to high school diploma or 
ol diploma. The GED and college degree holders 
with the high school diploma holders. 



Some variables were not 
analysis oecause it was fo 
in the sample with values f 
sex could not be used i 
rating oecause there were n 
population. Lik.ewise, th 
oecause not enough of the s 
test. Some of the other 
oecause it was decided at t 
not be used in policy 
example of this type of var 
analysis with a descript 
listed in Appendix A. 



used in parts or all of the 
und there were not enough members 
or these variables. For example, 
n analyzing the Boiler Technician 
o female members in the sample 
a A37A3 scores could not be used 
ample population had taken this 
determining factors were not used 
he outset that the variable could 
decisions. Home of record is an 
iable. All variables used in tne 
ion of their types and ranges are 
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III. definition of groups 



Due to the non-availability of data listing all test 
takers and test passers for the August 1977 cycle, a 
comparison of the August 1977 EMH and the August 1978 BMR 
was necessary to determine who passed the exam. The E-4 
exam is given only twice a year and all test passers from a 
given cycle who are advanced to E-4 are advanced from three 
to nine months after the cycle. Therefore, if a member was 
eligible to take the exam according to the August 1977 file 
and was promoted between December 1 977 and May 1978 , he was 
classified as a test passer. If he was eligible to take the 
exam in August 1977 but he was not promoted during the 
advancement period, he was classified as a non- test passer 
regardless of whether or not he took the exam. This is 
illustrated in the Figure 1. Finally, if he was eligible to 
take the exam in August 1977 and was not in the August 1973 
file, he was listed as out of service because it was not 
possible to determine if he passed the exam or not. 
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SHADED AREA 
INCLUDES ALL 
NON-TEST PASSERS 
AS DEFINED ABOVE 




Graphical Ssprssenti tion of Test Passers and Son-test 

Passers 



Figure 1. 



Once the groups were determined, it was possible to 
evaluate the data by either of two methods. First, since 
the out of service member was in the Savy at the time the 
test was given, he was actually a test passer or a non-test 
passer. The assumption would be made that the reason he 
left the service was that he completed his time in service 
requirements. Therefore, time alone would distinguish the 
out of service member from the test passer and the non-test 
passer. If all factors involving time in the Navy such as 
length of service and time in rate are ignored, then the 
test passer and non- test passer groups may be looted on as a 
random sunset of the original test passer / non-test passer 
/ out of service data. This gives just two all inclusive 
groups as they ware found on the day of the test. 
Onf ortunatel y, one is forced to ignore military time related 
factors (variables 13, 14, and 15, Appendix A] which might 
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prove to be useful. Evea so, this approach might well be 
considered useful in evaluating enlistees who as /at have no 
time in service. 

The second method allows the inclusion of military time 
dependent factors. In this approach the data are viewed at 
the test cycle plus one year position (August 1978). From 
this time perspective, a member may fall into only one of 
the three groups; test passer, non-test passer, or out of 
service. Since all members are included in this approach 
and no assumptions of random subsets are made, the time 
dependanr factors may be used. Although classification of 
out of service personnel may at first seem extraneous to the 
problem, it is useful knowledge, and may in fact be as 
important as identifying the test passers. 

Both of the above methods of grouping may be applied at 
various levels of the Davy's structure- First they may be 
used at the all Navy level. Here all E-3 ' s who meet the 
time requirements for E-4 would be considered, regardless of 
rate. Next, the individual ratings may be investigated. 
Before a member may take an E-4 exam in a particular rating, 
he must complete certain study and practical factors 
requirements for than rating. Members in the process of 
doing rhis are refered to as designated strikers in that 
rating and are identified on the EMR file. By analyzing 
only the designated strikers in a specific rate, it may be 
possible to predict the test passers for that rats. 
Finally, for the sample available, certain ratings had too 
few designated strikers to give any conclusive results. To 
handle these cases, hierarchical clustering methods could be 
used to group ratings according to similarity of specified 
attributes of the ratings. Each group of ratings could then 
be subjected to analysis. 
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IV 



• A££BQ1Ches to a nalys is 



The data used in this analysis consisted of fifteen 
variables per mecaber. The intent vas to divide the Bembers 
into groups according to these variables. Hence, the use of 
soae form of aultivariate analysis was dictated. 



The intent of th 


necessary 


to find 


groups in 


terms of 


difference 


did e 


classification sc hem 


acceptably 


high pare 



Discriminant analy 
desired analyses. D 
statistical technique 
by which data units 
groups to which they h 
decision rules are s 
variables in the funct 
This analysis is vali 
group case. 



analysis was two-fold. First, it was 
ut if a difference existed between the 
the available data. Second, if a 
St, was it possible to obtain a 
which would correctly classify an 
tage of the laambers? 

sis is one method of performing the 
iscriminant analysis is a multivariate 
used for constructing decision rules 
(enlisted members) may be assigned to 
ave the greatest resemnlance. These 
tatistical functions. The independent 
ions are the attributes of the member, 
d for both the two group and the three 



There are three basic assumptions underlying 

discriminant analysis J 2J . First, the groups must be 
discrete and identifibla. Both grouping methods defined in 
the last section meet this requirement. Second, each 

observation in each group can be described by a set of a 

variables. This condition is met oy use of the fifteen 

determining factors. Finally, all a variables are assumed 
to have a aultivariate normal distr ioution. All of the 
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iatsrvai data for this study were unimodal aud symmetric. 
The data were judged to have a close enough approximation to 
a normal distribution to meet this assumption. The binary 
data, of course did not. However, according to Gilbert |3|, 
binary data may be used with little loss of discriminating 
power. This is because the rotations of the axes which 
occur in discriminant analysis cause the binary data to 
appear to be binary no longer. 

As an example, consider a two space problem of binary 
vs. interval data: 




Binary vs. Interval Data 
Figure 2. 



If 

function 
interval 
f ollowin 



the axis is rotated 
, the data no longer 
, but interval vs. 
g figure. 



to obtain 
appear to 
interval as 



a discriminant 
be binary vs. 
shown in the 
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Iransformed Binary vs. Interval Data 
Figure 3. 



Given that all the basic assumptions for 
discriminant analysis are satisified, the 
discriminant function must be chosen, 
variance-covariance matrices of the groups are 
linear function may be used. If they are no 
better discrimination is achieved by a quadratic 
A two-dimensional illustration of this is s 
following diagram. The inequality of the 
matrices is reflected by the elliptical contours 
density. 



the use of 
type of 
I f the 

equal, a 
t equal the 
function, 
hown in the 
covariance 
of constant 
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Quadratic vs, Liaear Discriminating Functions 

Figure 4. 



The quadratic function above misclassif ias only one data 
point, wkereas the linear function mi sclassif ies five 
points. When given a choice, even if the 
variance-covariance matrices are equal, it is wise to choose 
the quadratic function. The quadratic should always perform 
at least as well as the linear function. 

As was stated in Chapter III, some ratings do not have a 
large enough sample in any testing cycle to allow one to 
conduct a meaningful analysis. To resolve this problem, 
cluster analysis was used to group the ratings. 

There are several methods of cluster analysis available. 
The particular type used in this study is termed 
hierarchical clustering. For this method, the data 
attributes of each entity (rating) are investigated to 
determine which two entities are the most alike 1,4]. These 
two entities are clustered together to form a new entity and 
an averaged set of variables is computed for the newly 
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formed claster. This cycle is repeated until there is one 
final cluster containing all of the original entities. The 
results can be represented as a tree with all the original 
units on the left and the single cluster on the right. 



Hierarchical Clustering Tree 
Figure 5. 



The major advantage to the hierarchical clustering 
methods is that they allow one to view the overall 
relationships among the data units. The tree diagram* shows 
the natural clusters which occur and the degree of 
similarity at the end of each clustering cycle. From this, 
the analyst is free to choose the optimal number of clusters 
for his purposes. 
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V. TECH NICA L DEVILS OF PROCEDURES USED 



Discriminant analysis was applied to the data to 
determine if a difference existed between the group means 
and, if so, to construct decision rules to place members in 
the proper groups. 

The first step in the analysis was to determine if the 
variance-covariance matrices of the groups were equal. The 
results of this dictated whether or not to use a linear or 
quadratic discriminating function. In this analysis, every 
sample produced unequal variance-covariance matrices. 
Therefore, the quadratic function was used in every case. 

Next, the data from each group were investigated to 
determine if a difference existed between the means of the 
groups in terms of the variables being investigated. The 
derivation of the two group case is presented in the 
following paragraphs. The three group case is found in 
Appendix B. 

For this analysis, the groups were chosen to be of equal 

size (N = N = N ) and the variance-co variance matrices are 
1 2 

assumed to be unequal. N observations from each of the test 

passer and non-test passer groups may be represented in 

th 

vector notation as follows, where the n observation from 
each group is a 1 xm vector 

X - ' ... ,x ) 

n 1 , n m, n 
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i 




I 



where n=1, ... /N and a is the number of variables in each 

observation. p and p are also vectors of length m. 



The null hypothesis for this section of the analysis is that 
the means of the groups in terms of the the ietermining 
factors are equal or 



a : p = p 

0 1 2 



The standard method in univariate statistics for 
determining if the mean of a population with unknown 
variance is equal to a specific value is the t-test. Given 

2 

that the population has a p, a ) distribution, the t 

statistic is defined as follows 



t = (v/N (x-p) ) /s 



The extsntion of the t statistic to multivariate analysis is 
2 

the T statistic whicn is defined as 



2 -1 
T = N (x-p) *S (x-p) 



Now let y = X -X and N = N +N . y is defined as 
i i, 1 i,2 12 



y = 1/N Z (X -X ) = X -X 

i,1 i,2 1 2 
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and the variancs-covar ia nca matrix is 



s = 1/ (N-1) (y ,-y) ' 

1 X 



= 1/ (xN-1) Y. 



X -X +X ) (X -X -X +x ) ' 

i,2 1 2 i,^ i,2 1 2 



2 

The T statistic in terms of y is 



2 -1 
r = Ny 'S y 



and the null hypothesis as stated above can be rewritten as 



H : u - n = 0 

0 1 2 



2 

to accomodate T in terms of y. 



The object of 
about ji and 



the 



2 

T test is to rind a confidence region 
as shown below. The size of the 



confidence 



region is dependent on a 



the level of 



significance of the test. 
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Confidence Region About u - y 

1 2 

Figure 6. 

2 

With a scaling change, the T statistic is distributed 

according to an F distribution with a and N-m-1 degrees of 
freedom 1 5 1 . 



2 

T 

M 






m 

f ( a) 
N~ m- 1 



Therefore, if 



2 

T 

N 



pF^Tli 



a 

F (a) 
N-m-1 



the hypothesis that the means are equal is rejected. 

I 

Once the inequality of the means has been established, 
the decision rules can be constructed. Again, the two group 
case will be dealt with here and the three group case can be 
found in Appendix C. 
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Anderson states that in addition to the basic 
discriminant function, the classification function must taice 
into account the a priori probability of group membership 
and/or the costs of misclassification. Furthermore, he 
states that, "the good classification scheme minimizes the 
bad effects of misclassification" ioj. As a means of 
exploring this, let M be the misclassification function 
which is to be minimized. 



a = P(1 I 2) + ? (2 1 1) 7T 



IT is the a priori probability of an observation being 
h 

drawn from group h, and P{g|h) is the probability of 

classifying an observation as a member of group g when it is 
actually a member of group h. The oosts of 
misclassification could be included but are unknown and are 
nor dealt with in this study. The quantity d is minimized 
by the following rule: assign to group one if 

f (X) /f (X) > TT / TT 

1 2 - 2 1 



Otherwise, assign to group two. f (x) is the probability 

i 

density function of population i. Assuming aultivariare 

normal populations with unequal variance-covariance 
matrices, the above equation becomes 



1/2 1/2 -1 -1 
C(2Tr) Cz^l 1 exp-1/2C(X-y^) • ^ 

T 72 1/2 -1 I 1 

1 exp-l/2j.(X-u^) • ^ ’'l 



where Z is the variance. Talcing natural logs of both sides 
and simplifying yields 
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-1 

1/2 !l-1/2l.(X- ' 



-1 - 1 
(Xni 



3y rearranging the equation, the quadratic decision rule is 
formed. 



-1 -1 -1 
X' (E^ - E^ ) X - 2(u^' E^ 



-1 



2 2 



-1 -1 
E ) X + u' E 



1 1 



1^. ~ 1 ^ M 



-1 

M 

2 2 2 



< In r s . E 1 - 21nl it /it 1 

-21 21 



If the above equation is true, the observation should be 
assigned to group 1. If not, assign the observation to 
group 2. 



It is also possible to determine the contribution that 
each variable malces to the discriminant function. That is, 
it is possible to compute the relative amount of 
discriminatory power that each vaxiabie has. 



th 

Let V be an axl vector of coefficients of the i 

j 

discriminating function. This vector is transformed into a 



scaled vector V by multiplying each element of v by the 

i i 

square root of the corresponding diagonal element of tae 

pooled within groups deviation sum of squares matrix , 
where 



W = (X - X ) (x - X ) 

i,i gin gi gin gi 
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The scaled vector V’ is 



V 

i 




W ,V V , 

1,1 i,2 2,2 



< 7 . w -j 
X , n m , m 



The discrimination power of each variable can now be 
expressed as 












The method of hierarchical clustering used in this study 
is due to Ward t6g . This method is based on the computation 
of the Euclidean distance between centroids of the entities. 

th 

For this discussion, let x be the value of the i 

ijlc 

th th 

variable of the j data unit in the Ic cluster. The mean 

th th 

of the i variable in the Ic cluster is 



X = 1/m X 
ik k ijk 



and the error sum of squares for cluster k is 



E = (X - X ) 

k ijk ik 



The total within group error sum of squares is 
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The criteria for the Sard method is the ainiiaization of 
the increase in the withia group error sum of squares. 

ae = e - e - e 

pq t p q 

where E and E are the within group error sum of squares 

P q 

for the two entities which are joined to form the cluster t. 
3y simplification 



2 

AS = m m (X -X ) 

p q 

This function should force the entities to group into tight 
clusters. Also, this should cause the distance between the 



clusters to be at a maximum 



VI. RESULTS OF ANALYSIS 



The discr ininant anal/sis discussed in this section was 
done with the aid of computer routines appearing in 
Eisanheis and Avery [2). These routines use a complete 
stepwise procedure to guarantee the best discriminant 
function possible from the given data. This procedure 
investigates every possible comnination of determining 
factors beginning with a subset of one and continuing up to 
m factors 123 . For each subset the best combination of 
variables is chosen. When the analysis is complete, the 
analyst is free to choose the subset with the bast function 
for his purposes. In this study, all m variables were used 
in every case. 

The analysis was conducted on six samples. The first 
was at the all Navy level where eligible members were 
randomly selected without regard to whether they were 
designated strikers. Next, the ratings in each of the 
Navy's general assignment categories (Seamen, Firemen, 
Airmen, and Constr uctionmen) were subjected to cluster 
analysis to find which rates tended to be most alike. These 
clusters were formed using computer routines designed by 
Anderberg 14|. The variables used to determine the 
clustering were the average individual ASVAS test scores 
from the August 1978 cohort. These variables were chosen 
because they normally remain reasonably constant over a 
member's career. The averages were computed by summing the 
test scores (separately for each subtest) for every E-4 
through S-6 in sach rate and dividing by the number in the 
rate. Three of the clusters obtained from the Seamen 
category were subjected to discriminant analysis. The last 
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two sample popU|latioQS to be analyzed were individual 
ratings: the Boiler Technicians and the Machinist’s Mates. 

The following diagram depicts the analyses which were done. 
The reader may find it helpful to refer to this while 
reading the suceeding material. 



ALL NAVY 

2 3 

GROUP GROUP 



SN CLSTR 1 


SN CLSTR 2 


SN CLSTR 4 


2 3 


2 


3 


2 3 


GROUP GROUP 


GROUP 


GROUP 


GROUP GROUP 


BOILER 


TECH 


MACHINIST'S MATE 


2 


3 


2 


3 


GROUP 


GROUP 


GROUP 


GROUP 



Discriminant Analyses Conducted in this Study 

Figure 7. 

For the ail Navy sample, 300 members of each group, 
i. e., test passers, non-test passers, and out of service 
personnel were randomly selected from the eligible pool. 
First, the two group case, test passers vs. non-test passers 
was analyzed. After checking the variance-covariance 
matrices and finding them to be unegual, a comparison of tae 
group means was made. The test for equality of group means 
produced an F statistic of 3.83 which rejected the null 
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hypothesis at tha 0.01 Level of significance (see Appendix 
D) . The variables in this sample which provided the 
greatest power of discrimination were race {2'-i.5%) , high 
school diploma (20.3%)/ and dependents (13.59%). The 
analysis indicated Caucasian members had a higher 
probability of passing the exam then their non-Caucasian 
counterparts. Individuals having a high school diploma 
and/or having dependents also were more likely to pass the 
exam. A complete list of the determining factors and the 
amount of discriminating power for each sample population is 
listed in Appendix S» 

Once the discriminant function was computed, tne 

classification of each member was done using a quadratic 
function as described in Chapter V. The rate of "hits” or 
correct classification for the all Jlavy two group sample was 
62% (see Appendix H) . This classification rate includes the 
effects of the a priori probability of group membership as 
do all other classification rates in this study. 

The three group case for the all Navy sample was also 
investigated. Here the equality of the group means was 
rejected at the 0.01 level of significance. In this case 
the length of time remaining in service (SAGS), 32.0%, and 
length of service, 12.27%, proved to be the most powerful 
determining factors. 3oth of these variables discriminated 
best between the out of service group and the other two 
groups. This is probably because a member with one year or 
less remaining in his obligation is not likely to take the 
exam if he is planning to leave the service, because the 
rewards he will reap are not worth the effort to prepare for 
the exam. In this sample the rate of hits was 63.0%, which 
was sligatly better than the two group case. 

Clustering analysis of the seamen rates resulted in four 
clusters (see Appendix F) . The ratings tended to group by 
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job skills; the elactroaics rates in one cluster, the 
administrative rates in another, etc. Three of the clusters 
were subjected to discriminant analysis. The first cluster 
investigated was cluster four in Appendix F. The test for 
equality of group means generated an F statistic of 1.99 
which was significant at the 2.6X level. The largest 
discriminating variables in this sample were years of 
education (21.77%), race (15.0%), the numerical reasoning 
section of the BTB (13.7%), and the clerical aptitude 
section of the BTB (10.46%). For this sample 87% of the 
observations were correctly classified. 

Cluster four was also subjected to the three group 
discriminant analysis. Eiere the F statistic was 5.25 and 

-30 

was significant at the 10 level. As in the all Mavy 

group, length of time remaining in service was the largest 
discriminator (29%) , but was followed by the BTB test for 
shop aptitude (12.09%) and time in service (8.24%). The 
three group sample was correctly classified 59.0% of the 
time. 

The Seamen rating cluster two, two group sample, was 
analyzed next. The test for equality of group means was 
significant at the 12% level, indicating that the means of 
the groups were close to one another. Since the means were 
so close, the discriminating variables and the 
classification rates should be viewed with care. 

The three group case of the Seamen cluster two did not 
test as having equal means and classified 63.3% of the 
observations correctly. Even so, only 43% of the non-test 
passers were correctly classified. 

The Seamen cluster one consisted mostly of electronics 
ratings. The two group test of equality of group means 
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produced an ? statistic which was significant at the 4526 
level. This is hard to accept until one realizes the entry 
requirements for these rates are very specific. Host 
entrants have a high school diploma, at least twelve years 
of school, have gone to A-school, etc. In short, the 
cluster is s c homogeneous in terms of the available data 
that it was not possible to discriminate between the groups. 

The three group case faired somewhat better as with the 
other clusters. The test for equality of the means was 
rejected and the classification was correct 69.0% of the 
time. However, it must be noted that only 38% of the test 
passers were correctly classified. 

The test of group means for the Boiler Technicians two 
group sample resulted in an F statistic which was 
significant at 94%. Because of this, the classification 
results should be used with care. Sven though the three 
group analysis showed the means were not equal, the 
percentage of correct classifications was only 50.0%. Again 
it was this good only because of the addition of the out of 
service group. The distinction between the test passers and 
non-test passers was not very good, with only 23.0% of the 
non-test passers being correctly identified. 

The Machinist’s Mata rating two group case produced 
surprising results when the test of the equality of the 
group means produced an F statistic which was significant at 
0.36%. Considering the results of the other samples, one 
would have expected the significance to be much larger. The 
most powerful determining factors for this case were the 3T3 
tests for general intelligence (29.91%) and mechanical 
aptitude (26.44%) . Of the total sample, 66% was classified 
correctly. 

The three group test produced an P statistic which was 
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-26 

significant at 10 . This sample stowed time remaining in 

the Navy to be the most powerful determining factor at 
18.05%. Of the total sample, 66% was correctly identified 
using the decision rules. 

From the above results it can be seen that the means in 
most of the two group samples are too close together to 
permit any distinct discrimination. This is also true for 
the three group samples because, although the out of service 
group is distinct enough to cause the equality of group 
means test to fail, the test passer and non-test passer 
groups remain as indistinct as before. The fact that the 
all Navy sample tasted as not having equal means, while all 
other samples did, appears to be due to the variability of 
the data in the ail Navy case. 
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VII. CONCLOSIONS 



Discrimiriaat anal 7 ses and clustac analyses were used in 
this study as tools to attempt to determine if there was a 
difference between B-4 test passers and non-test passers in 
terms of the available biographic data. 

The study has shown with a high level of certainty that 
the available data do not differentiate sharply between the 
groups. It would appear that the entrance regui cements for 
the Mavy or the specific eating create a homogeneous group 
of personnel in terms of these variables. Since there 
appears to be nothing in the general biographic background 
of a member to distinguish between the groups, other 
possible explanitory variables might be inspected. For 
instance, where was a member's last duty station, what type 
of leaders did he have, was he training in a rating he 
liked? 

Although the discriminating variables were not found in 
the study, the use of the E-4 exam as a measure of 
effectiveness of a member's utility to the Mavy still merits 
study. Sharper discrimination might result if data 
reflecting actual test takers is used. 
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APPENDIX A 



LIST OF DETEHMINING FACTORS 



FACTOR 



TYPE 



RANGE 



1. Dependants 



Binary 



0-No dependents 
1 - Dependents 



2. Age 

3. Race 

4. Sex 



Interval 

Binary 

Binary 



b. A-School Binary 

6. Years of education Interval 

7. Education Binary 

cert ification 



3. BTB (Gen. Intel.) Interval 

9. BTB (Nuflj. Reason.) Interval 



19-3 3 

0- Non-Caucasian 
1 - Caucasian 

0 - Female 

1 - Male 

0 - No A-School 

1 - A-School 

9-16 

0 - No high 
school diploma 

1 - High school 
diploma 

31- 74 

32- 69 
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10. BTB (dech. kpt.) 


Interval 


37-70 


11. BTB (Clr. Apt.) 


Interval 


38-76 


12. BTB (Shop Apt. ) 


I nterval 


37-74 


13. Time in rata 


Interval 


6-59 mos 


14. Time remaining 


I nterval 


0-48 OIOS 


15. Length of service 


Interval 


12-72 mos 
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APPENDIX B 



K GSOOP TEST FOR BQOALITY OF GROaP MEANS 



0 12 k 



This implied u - u = 0 for 1 ^ i £ j £ k 

i j 



Let 

0 ) ( 2 ) = 



(D ( 1 ) = 

/2 + ^ /2 
1 2 




(j4k— 1 ) 



y/(k-1) *■ p/(k-1) + ...-U =0 

1 k- 1 



Now a : 1 yo){i) = 0 

0 ]c=T^ 



ahich is the form of Anderson's test of the hypothesis 



a 

0 



.X . 

1 1 



o 



Let y = Sx +Bx ■►...+6x 

i 1if1 2i,2 ki,k 



|5l. 
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s 



i 



then y = 



e X 

i i 



and 



s = 1 (y.-y) (y. -y) • 

3-T 1 1 

2 -1 
and T = N (y- ) *S (y- ) 



if 

N-ia-1) > f“* ( oj 

J}-2) ai a-m-l 



then reject the equality of group means. 
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APPENDIX C 



K GROUP GLASSIFICATION FUNCTION 



Assign to group g if for all other groups h: 



f ( x) /f (x) h— /k/h — g 

g h ~ h g 



Assign to group g if for all other groups h: 



-1 -1 -1 -1 -1 -1 

(E -E )X-2(u’2: -u 'E )X + y*E \i ~ u'E u. 

g h gg hh ggghhn 



-1 

> Inf E . E J - 21nf 77 / it1 

~ g ^ g 
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APPENDIX D 





TESTS OF 


EQUALITT 


OF GSOOP MEANS 






EQaALITY OF 


7AHIANCS 


EQUALITY 0? 


MEANS 




F STATISTIC 




F STATISTIC 


LEV 


All Navy 
(2 Group) 


1.78 


0.0 


3. 83 


0. 


All Navy 
(3 Group) 


2.34 


0.0 


16.69 


0. 


SN Clsrr 1 
(2 Group) 


1 .88 


0.0 


.99 


45. 


3N Clstr 1 
(3 Group) 


2.07 


0.0 


a. 15 


0. 


SN Cls-tr 2 
(2 Group) 


1.4 1 


.87 


1 . 49 


1 2. 


3N cisrr 2 
(3 Group) 


1.75 


0.0 


3. 14 


0. 


SN Clstr 4 
(2 Group) 


1.98 


0.0 


1.99 


2. 


SN Clstr 4 
(3 Group) 


2.06 


0.0 


6. 25 


0. 


BT 

(2 Group) 


2.08 


0.0 


. 4 1 


94. 



7BL 

.0 

. 0 

I. 29 

1.0 

36 

>.Q 

:.56 

I. 0 

■ .70 
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1 



BT 

(3 Group) 
MM 

(2 Group) 
MM 

(3 Group) 



1.78 



1.90 



2 . 10 



0.0 



0.0 



0.0 



4.46 



2. 61 



7.43 



0 . 0 
. 36 
0.0 
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APPENDIX E 



2 GROUP 

Race 

Sdu. Cert. 
Dependents 
Age 

BTB (Clr. Apt.) 

BTB (Nua. Reason.) 
BTB (flech. Apt.) 
A-Schooi 
BTB (Gen Intel.) 
Years of edu. 

BTB (Shop Apt.) 

Sex 



POaSa OE DETERdlNING FACTORS 



ALL NA7Y 



3 GROUP 



24.50% 


Time remaining 


32.03% 


20.30% 


Time in service 


12.27% 


13.59% 


Dependent s 


7.84% 


10.55% 


Sex 


7.45% 


a. 47% 


Edu. Cert. 


6.99% 


5.58% 


Time in rate 


6.46% 


5.28% 


BTB (Shop Apt.) 


6.04% 


4.69% 


BTB (dech. Apt.) 


4.85% 


4.59% 


A-School 


3.98% 


1.57% 


BTB (Clr. Apt.) 


3.04% 


0. 48% 


Race 


2.43% 


0.35% 


Age 


2.05% 




Years of edu. 


1.67% 




BTB (Gen. Intel.) 


1.49% 




BTB (Num. Reas.) 


1.35% 
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fT^iTu(T^ fflll 



SEAHEN GEOaP 1 



2 GSOOP 3 GEOUP 



BTB (Shop Apt.) 


29.43S 


Time remaining 


35.72% 


3TB (Clr. Apt) 


17. 83? 


BTB (Gen. Intel.) 


10.95% 


BTB (Hunt. Eeason.) 


14.32% 


BTB (Shop Apt.) 


8.82% 


BTB (Gen. Intel.) 


8.55% 


Age 


8.19% 


BTB (Mech. Apt.) 


6.58% 


Time in rate 


6.21% 


Dependents 


6.07% 


BTB (Clr. Apt.) 


6.17% 


A-School 


5. 14% 


BTB (dech. Apt.) 


4.89% 


Age 


4.23% 


Time in service 


3.54% 


Race 


3. 17% 


A-School 


3.46% 


Sdn. Cert. 


3. 14% 


Dependents 


3.45% 


Irs of edu. 


1.33% 


Edu. cert. 


3.38% 






BTB (Hum. Season.) 


2.13% 






Yrs of edu. 


1.93% 






Race 


1.03% 
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SEAMEN 



Gfioap 2 



2 GROaP 3 GROUP 



A-School 


17.30^ 


Tiae remaining 


37.68% 


BT8 (Mech. Apt.) 


14.98% 


A-School 


6.76% 


flTB (Clc. Apt.) 


13.35% 


Time in rate 


6.69% 


BT B (Shop Apt.) 


11. 26% 


Dependents 


6.24% 


l£S of edu. 


9.62% 


BTB (N um. Reason . ) 


5.76% 


Edu. cert. 


9. 1 7% 


Edu. cert. 


5.54% 


Sex 


7. 14% 


Yr of edu. 


5.42% 


3TB (Gea. Intel.) 


4.35% 


Time in service 


5.22% 


BT3 (Num. Reason.) 


4.73% 


Race 


4. 52% 


Age 


2.81% 


BTB (Shop Apt.) 


3.99% 


Dependents 


2.40% 


BTB (Mech Apt.) 


3.57% 


Race 


1.30% 


BTB (Clr. Apt.) 


3.49% 






BTB (Gen. Intel. ) 


2.50% 






Sex 


1.39% 






Age 


1.05% 



43 



SEAMEN GROUP 4 



2 GROUP 



years of edu . 


21.77S 


Race 


15.04% 


BTB (Hum. Season.) 


13.70% 


BTB (Clr. Apt.) 


10.46% 


BTB (Shop Apt.) 


10.31% 


Sex 


6. 13% 


Bdu. cert. 


6. 01% 


Dependents 


4.52% 


BTB (Gen. Intel.) 


3.41% 


A- School 


3. 32% 


Age 


2.39% 


BTB (Seen. Apt,) 


2.39% 



3 GROUP 



Time remaining 


29.27% 


BTB (Shop Apt.) 


12.09% 


Time in service 


8.24% 


Dependents 


7.94% 


Edu. cert. 


7. 14% 


BTB (Gen. Intel.) 


6.67% 


A-School 


4.39% 


Years of edu. 


4.15% 


BTB (Nua. Reason.) 


4.06% 


Sex 


3.97% 


Race 


3.94% 


Time in rate 


2.58% 


BTB (Mech Apt.) 


1.96% 


BTB (Clr. Apt.) 


1.79% 


Age 


1.74% 
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BOILER TECHNICIAN 



2 GROOP 




3 GROUP 




BTB (Gea. Intel.) 


18. 1 


Time remaining 


18.05% 


BTB (Nua. Reason.) 


16.96* 


Years of edu. 


15.39% 


BTB (Clr. Apt.) 


15.62* 


BTB (Shop Apt.) 


12.84% 


Years of edu. 


12.32* 


Time in service 


10.73% 


Edu. cert. 


10.30% 


Age 


10.46% 


A-School 


9.43% 


BTB (aech. Apt.) 


10.25% 


STB (Shop Apt.) 


6. 99% 


Edu. cert. 


7.46% 


Dependents 


4.39% 


A-School 


3.23% 


3T3 (aech. Apt.) 


4.05% 


BTB (Gen. Intel.) 


3. 13% 


Age 


1.42% 


Time in rate 


2.9 2% 


Race 


0.35% 


BTB (Num. Reason.) 


2.19% 






Dependents 


1.39% 






Race 


1.30% 






BTB (Clr. Apt.) 


0.58% 
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m 



% 

I 






JiaCHIUIST’S 3ATE 





2 Gsoap 




3 GSOOP 




BIB (Gen. 


In tel. ) 


29.91 S 


Time remaining 


29.56% 


BT3 (Mech 


. Apt.) 


26.44% 


BTB (Mech. Apr.) 


15 .55% 


Age 




10.01% 


Edu. Cert. 


11.17% 


BIB (Hum. 


Season. ) 


9.68% 


Time in service 


8.30% 


BTB (Clr . 


Apt.) 


7.97% 


BTB (Gan. Intel.) 


6.9 1% 


Dependent 


s 


5.15% 


Time in rate 


6.26% 


Years or 


edu. 


5. 0 3% 


BTB {Clr. Apt.) 


5.29% 


A-School 




2.41% 


Age 


4.65% 


Race 




1.70% 


Years of edu. 


3.88% 


Edu. cart 


. 


1.53% 


Race 


3.05% 


BTB {Shop 


Apt.) 


0. 12% 


BTB {Shop Apt.) 


1.30% 








BTB {Num. Reason.) 


1.65% 








A-School 


1.62% 








Dependenrs 


0.25% 
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APPENDIX F 



EESULIS OP CLUSTERING OF SSAaSN RATINGS 





CLUSTER 1 




CLUSTER 2 


STS - 


Sonar Tach (S) 


SK - 


Storekeeper 


STG -- 


Sonar Tech (G) 


PC 


Postal Clerk 


FTG - 


Fire Cntl Tech (gun) 


IN - 


Yeoaan 


FTM - 


Fire Cntl Tech (aissle) 


DK - 


Dispursing Clerk 


FTB - 


Fire Cntl Tech (FBrt) 


?!S - 


Mess Specialist 


ETN - 


Electronics Tech {comm) 


sa - 


Ships serviceman 


ET3 - 


Electronics Tech (radar) 


LI - 


Lith ographer 


EW - 


Elec Narfare Tech 


CTA - 


Comm Tech (admin) 


CTM - 


Comoi Tech (maint.) 


TM - 


Torpedoman's Nate 


CTI “ 


Comoi Tech (intarp. ) 


3M - 


Boatswain's Nate 


DS - 


Data Systeas Tech 






lIT - 


aissle Tech 







47 






i 



CLUSTER 3 



CLOSTEH 4 



GMG - 


Guaner’s Bate 


(gun) 


CTT - 


Comm Tech (tech) 


GSH - 


Gunner’s Bate 


( missies) 


CTO - 


Comm Tech (comm) 


GMT - 


Gunner’s Mate 


(tech.) 


CTH - 


Comm Tech (collection) 


MN - 


ainesan 




DP - 


Data Processing Tech 


Ofl - 


Opticalnan 




IS - 


Intelligence Spec 


SA - 


aaster at Arms 




PN - 


Personnelman 


la - 


Instrument man 




aa - 


Eadioman 








DM - 


Draftsman 








QM - 


Quartermaster 








OS - 


Operations Spec 








OT - 


Ocean Sys Spec 








MU - 


Musician 








JO - 


Journalist 








LN - 


Legalman 
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APPENDIX G 



CLASSIFICATION RESaLTS 



SAaPLE 




% COaHECTLY CLASSIFIED 








TEST 


NON TEST 


OOT OF 




TOTAL 


PASSERS 


PASSERS 


5SE7IC; 


All Navy 
{2 Group) 


62% 


80% 


45% 


-- 


All Navy 
(3 Group) 


63% 


77% 


39% 


7 1?o 


SN Clstr 1 
(2 Group) 


74% 


32% 


38% 


— 


SN Clstr 1 
(3 Group) 


69% 


38% 


79% 


12 % 


SN Clstr 2 
(2 Group) 


63% 


77% 


49% 


— 


SN clstr 2 
{3 Group) 


6 3% 


71% 


43% 


16 % 


SN Clstr 4 
(2 Group) 


67% 


81 % 


50% 


— 


SN Clstr 4 
(3 Group) 


69% 


30% 


42% 


11 % 


3T 

(2 Group) 


60% 


86% 


34% 


-- 
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3T 51 % 

(3 Group) 

aa 66X 

(2 Group) 

aa 55 % 

(3 Group) 



80S 



71 % 

52% 



28% 

60% 

58% 



60% 



77% 
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APPENDIX H 



CiaSTERING THEE FOR SEAMEN RATING 



seamen 3ATIMGS 



ITSM MAM£ 

TE''!- S 

FI-£ CN^L 'TECH G 
TECH G 

FFE CMTL m 

ELEC WARFARE TECH 
yiSSLE TECH 

CCVM T £Ch ^ 

ELEC^PCMCS TECH N 
ELECTRCMICS TECH R 
FI-E CMTU TECH 8 
DATA $>S TECH 
TECH I 

GHN'IER* S ••^3'^E G 
yi NS MA N 

Qj^v.cr IS mate M 
GUN'jER’S MA'E T 
IN ST^CME.NTHAN 
>T c R AT ARM S 
CF'I C ALH 
COMH TECH T 
FER5 C^'NELM an 
CGVM '6CH R 

RTC^ESS TECH 

INTELL spec 

TECH c 

s IG'^L “IAN 

RACiC‘"AN 
uRAr TS'-IAN 
C’J ^RT PRM^ISTER 
CFERA'IONS SPEC 
OCEAN SYS TECH 
'^■JSlCi an 
JGURNA LHAN 
LEGALH '-H 
S^CREKEEPEP 
STAL C LERK 
TECH A 
Y etm i'< 

01 St si ng clerk 

AT ^ IM » S MA T£ 

LI TH CGFAPHEP 
MESS SPEC 
SHiP^S serviceman 
"'CFPeDCyA^* S VATE 

IT EM M A»i e 
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NOTE: 
FINAL 2 
JOIN AT 



BRANCHES 
CLASS 25 



14 15 



1 
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StAMEN 9ATi‘iGS 3-78 

THIS ~UN OEoiriS THE PORTION OF THE ’’REE GENERATED BETHEcM STAGE 1 AMO STAGE. A3 
•'HE CPrERICJ VALUES APE SEGMENTED INTO THE FnuLOWING CLASSES. 



CLASS 


SOUND 


UPPER 6CUNC 


i 


0 -149 ?26fi7E- 


■03 


0. 19742124E- 


•01 




0 .197‘^2124E- 


‘0 1 


0.39334521E- 


•01 


3 


0- 39334521E- 


•Cl 


C. 5692691 0E- 


■01 




0 .50926910E- 


01 


C. 76519205E- 


■01 


5 


O.70f:i.9205E- 


■01 


O.90111629E- 


•01 


6 


Q.9H111629E- 


01 


C.11 770397E 


00 


7 


0.1177C397E 


00 


0. 1372963 2E 


oc 


3 


Q. 13729632E 


00 


O.156030S6E 


00 


9 


O.lffc^f f6oE 


00 


C- 176401C1E 


oc 


LO 


0 •176401O1E 


00 


0.19607335E 


oc 


11 


C. 1 9607335 E 


00 


0.21566570E 


00 


12 


0 •21566570E 


uo 


C. 23525604E 


oc 


L3 


O.235250O4E 


00 


0. 2540503 9E 


00 




0. 2f4^5C39E 


00 


C. 27444273E 


00 


15 


0 •27444<i73E 


00 


C. 294035C3E 


oc 


16 


0.29'i03509 E 


00 


0.31362742E 


00 
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